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Listing of Claims: 

1 . ( current 1 y amended) A method for determining similarity 
between portions ol gene expression proliles in a computer 
comprising the steps of: 

processing a number oi gene expression proliles with a similar 
sequences algorithm thai is a time and intensity invariant 
correlation function to obtain a data set o£ gene expression 
profile pairs and a match fraction for each gene expression 
profile pair; 

listing gene expression profile pairs in clusters by their match 
fractions ; 

removing a iirst gene expression profile* from a cluster when 
another cluster has another gene expression profile with a higher 
match fraction with the iirst gene expression prof i l.e, unless the 
another gene expression profile requires a larger number of 
subsequences to achieve similarity with the first gene expression 
prof j 1 e; 

repeating the removing step until all gene expression profiles 
are listed in only one cluster; 

providing output of the listing of clusters of gene 
expression profiles to a us er. 
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2, (currently amended) A method for determining similarity 
between portions of gene expression profiles comprising the steps 
of: 

processing a number of qene expression profiles with a similar 

sequences algorithm that is a time and intensity invariant 

correlation function to obtain a data suL of gene expression 

pairs and a match fraction for each pair; 

listing gene expression pairs in clusters by their match 

fractions; 

removing a first gene from a first cluster when the first gene 
is also in a second cluster which has another gene with a higher- 
match fraction with the first qene than any of the genes in the 
first cluster have with the first gene, but; 

retaining the first gene in the first cluster arid removing the 
first gene from the second cluster when the difference between 
the highest match fraction of the first gene with a gene in the 
first cluster and the highest match fraction of the first gene 
with a gene in the second cluster is loss than a minimum 
difference threshold and the number of subsequences represented 
in the similar gene pair havinq the highest match fraction in the 
first cluster is higher than the number of subsequences 
represented in the similar gene pair having the highest match 
fraction in the second cluster; 

repeating the removing step until all genes are listed in only 
one cluster; 

providing output of the listing ol clusters of gene 
expression profiles to a user. 
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3. (currently amended) A method of determininq similarity 
between portions of qene expression prolilcy comprising the Steps 
of: 

processing data embodying a number of gene expression profiles 
with a similar sequences algorithm in a computer that is a time 

and intensity invariant correlation function to obtain a data sot 
of gene expression pairs and a match fraction for each pair; 
choosing a threshold match fraction; 

listing gene expression pairs in clusters by Lhcir match 
fractions above the threshold; 

adding each gene not already in a cluster to a cluster having 
another gene having a highest match fraction with the each gene 
without regard of the threshold; 

removing a first gene from a cluster when the first gene is also 
in another cluster which has another gene with a higher match 
fraction with the first gene than any oi the genes in Lhe clusLcr 
have with the first gene; 

rcpeating the removing step until all genes are listed in only 
one cluster; 

providing output of the listing of clusters of gene 
expression proiilos to a user. 
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4. (currently amended) a method for determining similarity 
between portions of gene expression profiles comprising the steps 
of: 

processing a number of gene expression profiles with a similar 
sequences algorithm that is a time and intensity invariant 
correlation function with a computer to obtain a data set of gene 
expression pairs and a match fraction for each pair; 
choosing a threshold match fraction; 

listing gene expression pairs in clusters by their match 
fractions above the threshold; 

adding each gene not already in a cluster to a cluster having 
another gene having a highest match fraction disregarding the 
threshold with the each gene; 

removing a first gene from a first cluster when the first gene 
is also in a second cluster which has another gene with a higher 
match fraction with the first gene than any of the genes in the 
first cluster have with the first gene, but; 

retaining the first gene in the first cluster and removing the 
first gene from the second cluster when the difierence between 
the highest match fraction of the first gene with a gene in the 
first cluster and the highest match fraction of the first gene 
with a gene in the second cluster is less than a minimum 
difference threshold and the number of subsequences represented 
in the similar gene pair having the highest match fraction in the 
first cluster is higher than the number of subsequences 
represented in the similar gene pair havinq the highest match 
fraction in the second cluster; 

repeating the removing and retaining steps until all genes are 
listed in only one cluster; 

providing output of the listing of clusters of gene 
expression profiles to a user . 
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5. (currently amended) A method in a computer for determining 
similarity between genes comprising the steps of: 
listing genes to be compared in a data set by their gene 
expression profiles; 

processing the listed gene expression profiles with a similar 
sequences algorithm that is a time and intensity invariant 
correlation function to obtain a data set of gene expression 
pairs and a match fraction for each pair; 
choosing a threshold match fraction; 

creating a set G in which to list. indices of genes accounted 
f or; 

assigning qenes i and j to a cluster a if they have a match 
fraction greater than the threshold; 

assigning gene k to the cluster a if it has a match fraction 
greater than the threshold with either gene i or gene j; 
assigning genes k and 1 to a cluster b if they have a match 
fraction greater than the threshold and if both gene k and gene 1 
do not have match ' fractions above the threshold with either gene 
i or gene j ; 

repeating the assigning steps until all genes to be compared 
have been considered; 

removing a first qene from a cluster when another cluster has 
another gene with a higher match fraction with the first gone; 
repeating the removing, step until all genes are listed in only 
one cluster; 

providing output of the listing of clusters oi gene 
expression profiles to a user . 
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6. (currently amended) A method in a computer for determining 
similarity between genes comprising the steps of: 
listing genes to be compared in a data set by Lheir gene 
expression profiles; 

processing the listed gene expression profiles with a similar 
sequences algorithm that is a time and intensity invariant 
correlation function to obtain a data set of gene expression 
pairs and a match fraction for each pair; 
choosing a threshold match fractions- 
creating a set G in which to list indices of genes accounted 
for; 

assigning genes i and j to cluster 1 if they have a match 
fraction greater than the thresholds- 
assigning gene k to cluster 1 if it has a match fraction greater 
than the threshold with either gene i or gene j; 
assigning genes k and 1 to cluster 2 jf they have a match 
fraction greater than the threshold and ii both gene k and gene 1 
do not have match fractions above the threshold with either gene 
i or gene j ; 

removing a first gene from a cluster when another duster has 
another gene with a higher match fraction with the first gene, 
unless the another gene requires a larger number of subsequences 
to achieve similarity with the first gene; 

repeating the removing step until all genes are listed in only 
one cluster; 

providing output of the listing of clusters of gene 
expression profiles to a user . 
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7 . (canceled) 
8- (canceled) 
9. (canceled) 

10- ( previously amended) A program product having computer ■ 
readable code stored on a recordable media for determining 
similarity between portions o£ gene expression profiler 
comprising: 

programmed means lor processing a number oi gene expression 
profiles w.ith a similar sequences algorithm that is a time and 
intensity invariant correlation function to obtain a data set oi 
gene expression pairs and a match fraction tor each pair; 
programmed means for listing gene expression pairs jn clusters 
by their match fractions; 

programmed means lor removing a first gene from a cluster when 
the first gene is also in another cluster which has another gene 
with a higher match fraction with the first gene than any of the 
genes in the cluster have with the first gene; 

programmed means for repeating the removing step until all genes 
arc listed in only one cluster ♦ 
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11 . (previously amended) A program product, having computer 
readable code stored on a recordable media for determining 
similarity between portions of gene expression profiles using 
output from a similar sequences algorithm that is a Lime and 
intensity invariant correlation function comprising: 
programmed moans for providing a gene expression pro£i.le data 
set as input to programmed means embodying a similar sequences 
algorithm that is a time and intensity invariant correlation 
function to obtain a data set of gene expression pairs and a 
match fraction for each pair as output from the programmed means 
embodying a similar sequences algorithm; 

programmed means for listing the gene expression pairs in 
clusters by their match fractions; 

programmed means for removing a first gene from a cluster when 
the first gene is also in another clusLer which has another gene 
with a higher match fraction with the first, gene than any of the 
genes in the cluster. have with the first gene- 
programmed means for repeat! nq the removing step until all genes 
are listed in only one cluster. 



-10- 



CHA9 2003 0003 US1 
Serial No: 10/629,448 
Filed: July 29, 2003 



PAGE 12/27 1 RCVD AT 6/26/2007 8:51:37 AM [Eastern Daylight Time] * SVR:USPTO-EFXRF.2/2 ■ DNIS:2738300 ■ CSID:USCLTLB4 ■ DURATION (mm-ss):06-04 



2007-06-26 08:51 USCLTLB4 704-594-8307 » USPTO P 13/27 

Patent 10/629,448 

12 , (previously amended) A program product iiaving computer 
readable code stored on a recordable media for determining 
similarity between portions of gene expression profiles 
comprising the steps of: 

programmed means for processing a number of gene expression 
profiles with a similar sequences algorithm that is a Lime and 
intensity invariant correlation function to obtain a data set of 
gene expression pairs and a match fraction for each pair; 
programmed means lor. listing gene expression pairs in clusters 
by their match fractions; 

programmed means for removing a first gene from a first cluster 
when the first gene is also in a second cluster which has another 
gene with a higher match fraction with the first gene than any of 
the genes in the first cluster have with the first, gene, but; 
programmed means for retaining the first gone in the first 
cluster and removing the first gene from the second cluster when 
the difference between the highest match fraction of the first 
gene with a gene in the first cluster and the highest match 
fraction of the first gene with a gene in the second cluster is 
less than a minimum difference threshold and the number of 
subsequences represented in Lhe similar gene pair having the 
highest match fraction in the first cluster is higher than the 
number of subsequences represented in the similar gene pair 
having the highest match fraction in the second cluster; 
programmed means tor repeating the removing step until all genes 
are listed in only one cluster. 
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13, (previously amended) A program product having computer 
readable code stored on a recordable media for determining 
similarity between portions of gene expression profiles 
comprising the steps of : 

programmed means for processing a number of gene expression 
profiles with a similar sequences algorithm that is a time and 
intensity invariant correlation function to obtain a data set ot 
gene expression pairs and a match fraction for each pair; 
programmed means for choosing a threshold match fraction; 
programmed means for listing gene expression pairs in clusters 
by their match fractions above the threshold; 

programmed means for adding each gene not already in a cluster 
to a cluster having another gene having a highest match fraction 
with the each gene without regard of the threshold; 
programmed means for removing a first gene from a cluster when 
the first gene is also in another cluster which has another gene 
with a higher match fraction with the first gene than any of the 
genes in the cluster have with the first gene; 

programmed means for repeating the removing step until all genes 
are listed in only one cluster. 
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14. (previously amended) A program product having computer 
readable code stored on a recordable media lor determining 
similarity between portions of gene expression profiles 
comprising the steps o£: 

programmed means for processing a number of gene expression 
profiles with a similar sequences algorithm that is a time and 
intensity invariant correlation iunction to obtain a data set of 
qene expression pairs and a match fraction for each pair; 
programmed means for choosing a threshold match fraction; 
programmed means for listing gene expression pairs in clusters 
by. their match fractions above the threshold; 

programmed means for adding each gene not already in a cluster 
to a cluster havj nq another gene having a highest match traction 
disregarding the threshold with the each gene; 

programmed means for removing a first gene from a first cluster 
when the first gene is also in a second cluster which has another 
gene with a higher match fraction with the first qene than any of 
the genes in the first cluster have with the first gene, but; 
programmed means for retaining the first qene in the first 
cluster and removing the first qene from the second cluster when 
the difference between the highest match fraction of the first 
gene with a gene in the first cluster and the highest match 
fraction of the first gene with a gene in the second cluster is 
less than a minimum difference threshold and the number of 
subsequences represented in the similar gene pair having the 
highest match fraction in the first cluster is higher than the 
number of subsequences represented in the similar gene pair- 
having the highest match fraction in the second cluster; 
programmed means for repeating the removing and retaining steps 
until all genes are listed in only one cluster. 
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lb. (previously amended) A program product having computer 
readable code stored on a recordable media lor determining 
similarity between genes comprising the steps ol: 
programmed means for listing genes to be compared by Lheir gene 
expression profiles; 

programmed means for processing the liisLed gene expression 
profiles with a similar sequences algorithm that is a time and 
intensity invariant correlation function Lo obtain a data set of 
gene expression pairs and a match fraction for each pair; 
programmed means for choosing a threshold match fraction; 
programmed means for creating a null set G(0) Lo hold genes 
accounted for; 

programmed means ior assigning genes i and j to cluster 1 if 
they have a match fraction greater than Lhe threshold; 
programmed means for assigning gene Jc Lo cluster 1 if iL has a 
match fraction greater than the threshold wiLh either gene i or 
gene j ; 

programmed means for assigning genes k and 1 to cluster 2 ii 
Lhey have a match traction greater than Lhe threshold and if both 
gene Jc and gene 1 do not have match fractions above the threshold 
with either gene i or gene j; 

programmed means for removing a first gene from a cluster when 
another cluster has another gene with a higher match fraction 
with the first gene; 

programmed means for repeating the removing sLep until all genes 
are listed in only one cluster. 
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16. (previous] y amended) A proqram product having computer 
readable code stored on a recordable media lor determining 
similarity between genes comprising the steps of: 
programmed means for listing genes to be compared by their qeno 
expression profiles; 

programmed means lor processing the listed gene expression 
profiles with a similar sequences algorithm that is a Lime and 
intensity invariant correlation function to obtain a data set of 
gene expression pairs and a match fraction lor each pair; 
programmed means for choosing a threshold match fraction; 
" programmed means for creating a null set G{0) to hold genes 
accounted for; 

programmed means for assigning genes i and j to cluster 1 if 
they have a match fraction greater than the threshold; 
programmed means for assigning gene k to cluster 1 i f it has a 
match fraction greater than the threshold with either gene i or 
gene j ; 

programmed means for assigning genes k and 1 to cluster 2 if 
they have a match fraction greater than the threshold and if both 
gene k and gene 1 do not have match fractions above the threshold 
with either gene i or qeno j; 

programmed means for removing a first gene from a cluster when 
another cluster has another gene with a higher match fraction 
with the first qeno, unless the another gene requires a larger 
number of subsequences to achieve similarity with the first gene; 
programmed means for repeating the removing step until all genes 
are listed in only one cluster. 
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IV . (canceled) 



18 . (canceled) 



19. (canceled) 



20. (currently amended) In a method of determining similarity 
between portions of gene expression profiles which includes 
processing a number of gene expression profiles using a computer 
with a similar sequences algorithm that is a time and intensity 
invariant correlation function to obtain a data set of gene 
expression pairs and a match fraction lor each pair, the 
improvement comprising the steps of: 

listing gene expression pairs in clusters by their match 
fractions; 

removing a first gene iirom a cluster when another cluster has 
another gene with a higher match fraction with the first gene, 
unless the another gene requires a larger number of subsequences 
to achieve similarity with the first gene; 

repeating the removing step until all genes are listed in only 
one cluster; 

providing output of the 1 isting of clusters of gene 
expression profiles to a u ser. 
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